Dataset statistics
| Number of variables | 16 |
|---|---|
| Number of observations | 3656 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 485.6 KiB |
| Average record size in memory | 136.0 B |
Variable types
| Categorical | 8 |
|---|---|
| Numeric | 8 |
cigsPerDay is highly overall correlated with currentSmoker | High correlation |
currentSmoker is highly overall correlated with cigsPerDay | High correlation |
diaBP is highly overall correlated with prevalentHyp and 1 other fields | High correlation |
diabetes is highly overall correlated with glucose | High correlation |
glucose is highly overall correlated with diabetes | High correlation |
prevalentHyp is highly overall correlated with diaBP and 1 other fields | High correlation |
sysBP is highly overall correlated with diaBP and 1 other fields | High correlation |
BPMeds is highly imbalanced (80.4%) | Imbalance |
prevalentStroke is highly imbalanced (94.9%) | Imbalance |
diabetes is highly imbalanced (82.0%) | Imbalance |
cigsPerDay has 1868 (51.1%) zeros | Zeros |
Reproduction
| Analysis started | 2024-11-11 05:06:25.929401 |
|---|---|
| Analysis finished | 2024-11-11 05:06:28.570885 |
| Duration | 2.64 seconds |
| Software version | ydata-profiling vv4.10.0 |
| Download configuration | config.json |
male
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3656 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2034 | |
| 1 | 1622 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2034 | |
| 1 | 1622 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2034 | |
| 1 | 1622 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2034 | |
| 1 | 1622 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3656 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2034 | |
| 1 | 1622 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3656 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2034 | |
| 1 | 1622 |
age
Real number (ℝ)
| Distinct | 39 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 49.55744 |
| Minimum | 32 |
|---|---|
| Maximum | 70 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 32 |
|---|---|
| 5-th percentile | 37 |
| Q1 | 42 |
| median | 49 |
| Q3 | 56 |
| 95-th percentile | 64 |
| Maximum | 70 |
| Range | 38 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 8.5611335 |
|---|---|
| Coefficient of variation (CV) | 0.17275173 |
| Kurtosis | -0.99161978 |
| Mean | 49.55744 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.2311704 |
| Sum | 181182 |
| Variance | 73.293006 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 40 | 166 | 4.5% |
| 46 | 166 | 4.5% |
| 42 | 161 | 4.4% |
| 48 | 149 | 4.1% |
| 39 | 146 | 4.0% |
| 41 | 145 | 4.0% |
| 44 | 143 | 3.9% |
| 45 | 140 | 3.8% |
| 43 | 137 | 3.7% |
| 52 | 129 | 3.5% |
| Other values (29) | 2174 |
| Value | Count | Frequency (%) |
| 32 | 1 | < 0.1% |
| 33 | 5 | 0.1% |
| 34 | 14 | 0.4% |
| 35 | 33 | 0.9% |
| 36 | 77 | |
| 37 | 80 | |
| 38 | 124 | |
| 39 | 146 | |
| 40 | 166 | |
| 41 | 145 |
| Value | Count | Frequency (%) |
| 70 | 1 | < 0.1% |
| 69 | 5 | 0.1% |
| 68 | 16 | 0.4% |
| 67 | 38 | 1.0% |
| 66 | 34 | 0.9% |
| 65 | 46 | |
| 64 | 80 | |
| 63 | 96 | |
| 62 | 91 | |
| 61 | 91 |
education
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.1 KiB |
| 1.0 | |
|---|---|
| 2.0 | |
| 3.0 | |
| 4.0 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10968 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4.0 |
|---|---|
| 2nd row | 2.0 |
| 3rd row | 1.0 |
| 4th row | 3.0 |
| 5th row | 3.0 |
Common Values
| Value | Count | Frequency (%) |
| 1.0 | 1526 | |
| 2.0 | 1101 | |
| 3.0 | 606 | 16.6% |
| 4.0 | 423 | 11.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 1.0 | 1526 | |
| 2.0 | 1101 | |
| 3.0 | 606 | 16.6% |
| 4.0 | 423 | 11.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 3656 | |
| 0 | 3656 | |
| 1 | 1526 | |
| 2 | 1101 | 10.0% |
| 3 | 606 | 5.5% |
| 4 | 423 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7312 | |
| Other Punctuation | 3656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3656 | |
| 1 | 1526 | |
| 2 | 1101 | 15.1% |
| 3 | 606 | 8.3% |
| 4 | 423 | 5.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3656 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10968 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 3656 | |
| 0 | 3656 | |
| 1 | 1526 | |
| 2 | 1101 | 10.0% |
| 3 | 606 | 5.5% |
| 4 | 423 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10968 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 3656 | |
| 0 | 3656 | |
| 1 | 1526 | |
| 2 | 1101 | 10.0% |
| 3 | 606 | 5.5% |
| 4 | 423 | 3.9% |
currentSmoker
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3656 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 1868 | |
| 1 | 1788 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 1868 | |
| 1 | 1788 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 1868 | |
| 1 | 1788 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1868 | |
| 1 | 1788 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3656 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1868 | |
| 1 | 1788 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3656 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 1868 | |
| 1 | 1788 |
cigsPerDay
Real number (ℝ)
HIGH CORRELATION  ZEROS 
| Distinct | 33 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.0221554 |
| Minimum | 0 |
|---|---|
| Maximum | 70 |
| Zeros | 1868 |
| Zeros (%) | 51.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 20 |
| 95-th percentile | 30 |
| Maximum | 70 |
| Range | 70 |
| Interquartile range (IQR) | 20 |
Descriptive statistics
| Standard deviation | 11.918869 |
|---|---|
| Coefficient of variation (CV) | 1.3210666 |
| Kurtosis | 0.9618248 |
| Mean | 9.0221554 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.2298317 |
| Sum | 32985 |
| Variance | 142.05943 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1868 | |
| 20 | 651 | 17.8% |
| 30 | 191 | 5.2% |
| 15 | 184 | 5.0% |
| 10 | 123 | 3.4% |
| 5 | 99 | 2.7% |
| 9 | 99 | 2.7% |
| 3 | 83 | 2.3% |
| 40 | 69 | 1.9% |
| 1 | 61 | 1.7% |
| Other values (23) | 228 | 6.2% |
| Value | Count | Frequency (%) |
| 0 | 1868 | |
| 1 | 61 | 1.7% |
| 2 | 16 | 0.4% |
| 3 | 83 | 2.3% |
| 4 | 8 | 0.2% |
| 5 | 99 | 2.7% |
| 6 | 17 | 0.5% |
| 7 | 11 | 0.3% |
| 8 | 9 | 0.2% |
| 9 | 99 | 2.7% |
| Value | Count | Frequency (%) |
| 70 | 1 | < 0.1% |
| 60 | 9 | 0.2% |
| 50 | 4 | 0.1% |
| 45 | 3 | 0.1% |
| 43 | 49 | 1.3% |
| 40 | 69 | 1.9% |
| 38 | 1 | < 0.1% |
| 35 | 19 | 0.5% |
| 30 | 191 | |
| 29 | 1 | < 0.1% |
BPMeds
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.1 KiB |
| 0.0 | |
|---|---|
| 1.0 | 111 |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Characters and Unicode
| Total characters | 10968 |
|---|---|
| Distinct characters | 3 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 0.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 0.0 |
Common Values
| Value | Count | Frequency (%) |
| 0.0 | 3545 | |
| 1.0 | 111 | 3.0% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0.0 | 3545 | |
| 1.0 | 111 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 7201 | |
| . | 3656 | |
| 1 | 111 | 1.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7312 | |
| Other Punctuation | 3656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7201 | |
| 1 | 111 | 1.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3656 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10968 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 7201 | |
| . | 3656 | |
| 1 | 111 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10968 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 7201 | |
| . | 3656 | |
| 1 | 111 | 1.0% |
prevalentStroke
Categorical
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.1 KiB |
| 0 | |
|---|---|
| 1 | 21 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3656 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3635 | |
| 1 | 21 | 0.6% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3635 | |
| 1 | 21 | 0.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3635 | |
| 1 | 21 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3635 | |
| 1 | 21 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3656 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3635 | |
| 1 | 21 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3656 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3635 | |
| 1 | 21 | 0.6% |
prevalentHyp
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3656 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 2517 | |
| 1 | 1139 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 2517 | |
| 1 | 1139 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2517 | |
| 1 | 1139 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2517 | |
| 1 | 1139 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3656 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2517 | |
| 1 | 1139 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3656 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2517 | |
| 1 | 1139 |
diabetes
Categorical
HIGH CORRELATION  IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.1 KiB |
| 0 | |
|---|---|
| 1 | 99 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3656 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3557 | |
| 1 | 99 | 2.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3557 | |
| 1 | 99 | 2.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3557 | |
| 1 | 99 | 2.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3557 | |
| 1 | 99 | 2.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3656 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3557 | |
| 1 | 99 | 2.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3656 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3557 | |
| 1 | 99 | 2.7% |
totChol
Real number (ℝ)
| Distinct | 241 |
|---|---|
| Distinct (%) | 6.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 236.87309 |
| Minimum | 113 |
|---|---|
| Maximum | 600 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 113 |
|---|---|
| 5-th percentile | 170 |
| Q1 | 206 |
| median | 234 |
| Q3 | 263.25 |
| 95-th percentile | 312 |
| Maximum | 600 |
| Range | 487 |
| Interquartile range (IQR) | 57.25 |
Descriptive statistics
| Standard deviation | 44.096223 |
|---|---|
| Coefficient of variation (CV) | 0.1861597 |
| Kurtosis | 1.8423574 |
| Mean | 236.87309 |
| Median Absolute Deviation (MAD) | 29 |
| Skewness | 0.6637004 |
| Sum | 866008 |
| Variance | 1944.4769 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 240 | 69 | 1.9% |
| 260 | 58 | 1.6% |
| 220 | 58 | 1.6% |
| 232 | 54 | 1.5% |
| 210 | 51 | 1.4% |
| 230 | 50 | 1.4% |
| 250 | 48 | 1.3% |
| 200 | 48 | 1.3% |
| 225 | 46 | 1.3% |
| 205 | 45 | 1.2% |
| Other values (231) | 3129 |
| Value | Count | Frequency (%) |
| 113 | 1 | < 0.1% |
| 119 | 1 | < 0.1% |
| 124 | 1 | < 0.1% |
| 133 | 1 | < 0.1% |
| 135 | 2 | |
| 137 | 1 | < 0.1% |
| 140 | 2 | |
| 143 | 3 | |
| 144 | 2 | |
| 145 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 600 | 1 | < 0.1% |
| 464 | 1 | < 0.1% |
| 453 | 1 | < 0.1% |
| 439 | 1 | < 0.1% |
| 432 | 1 | < 0.1% |
| 410 | 3 | |
| 405 | 1 | < 0.1% |
| 398 | 1 | < 0.1% |
| 392 | 1 | < 0.1% |
| 391 | 1 | < 0.1% |
sysBP
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 231 |
|---|---|
| Distinct (%) | 6.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 132.36803 |
| Minimum | 83.5 |
|---|---|
| Maximum | 295 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 83.5 |
|---|---|
| 5-th percentile | 104 |
| Q1 | 117 |
| median | 128 |
| Q3 | 144 |
| 95-th percentile | 175 |
| Maximum | 295 |
| Range | 211.5 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 22.092444 |
|---|---|
| Coefficient of variation (CV) | 0.16690167 |
| Kurtosis | 2.2766967 |
| Mean | 132.36803 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 1.1636945 |
| Sum | 483937.5 |
| Variance | 488.07608 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 130 | 90 | 2.5% |
| 120 | 87 | 2.4% |
| 110 | 83 | 2.3% |
| 125 | 79 | 2.2% |
| 115 | 75 | 2.1% |
| 124 | 73 | 2.0% |
| 122 | 70 | 1.9% |
| 128 | 68 | 1.9% |
| 116 | 65 | 1.8% |
| 132 | 63 | 1.7% |
| Other values (221) | 2903 |
| Value | Count | Frequency (%) |
| 83.5 | 2 | 0.1% |
| 85 | 1 | < 0.1% |
| 85.5 | 1 | < 0.1% |
| 90 | 2 | 0.1% |
| 92 | 1 | < 0.1% |
| 92.5 | 2 | 0.1% |
| 93 | 2 | 0.1% |
| 93.5 | 1 | < 0.1% |
| 94 | 3 | |
| 95 | 5 |
| Value | Count | Frequency (%) |
| 295 | 1 | |
| 248 | 1 | |
| 244 | 1 | |
| 243 | 1 | |
| 232 | 1 | |
| 230 | 1 | |
| 220 | 2 | |
| 217 | 1 | |
| 215 | 2 | |
| 214 | 1 |
diaBP
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 142 |
|---|---|
| Distinct (%) | 3.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 82.912062 |
| Minimum | 48 |
|---|---|
| Maximum | 142.5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 48 |
|---|---|
| 5-th percentile | 66 |
| Q1 | 75 |
| median | 82 |
| Q3 | 90 |
| 95-th percentile | 105 |
| Maximum | 142.5 |
| Range | 94.5 |
| Interquartile range (IQR) | 15 |
Descriptive statistics
| Standard deviation | 11.974825 |
|---|---|
| Coefficient of variation (CV) | 0.14442802 |
| Kurtosis | 1.2616823 |
| Mean | 82.912062 |
| Median Absolute Deviation (MAD) | 7.5 |
| Skewness | 0.7103882 |
| Sum | 303126.5 |
| Variance | 143.39644 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 80 | 217 | 5.9% |
| 82 | 138 | 3.8% |
| 85 | 119 | 3.3% |
| 70 | 114 | 3.1% |
| 81 | 114 | 3.1% |
| 84 | 104 | 2.8% |
| 78 | 104 | 2.8% |
| 90 | 103 | 2.8% |
| 87 | 97 | 2.7% |
| 86 | 94 | 2.6% |
| Other values (132) | 2452 |
| Value | Count | Frequency (%) |
| 48 | 1 | < 0.1% |
| 51 | 1 | < 0.1% |
| 52 | 2 | 0.1% |
| 53 | 1 | < 0.1% |
| 54 | 1 | < 0.1% |
| 55 | 3 | |
| 56 | 2 | 0.1% |
| 57 | 5 | |
| 57.5 | 2 | 0.1% |
| 58 | 4 |
| Value | Count | Frequency (%) |
| 142.5 | 1 | < 0.1% |
| 140 | 1 | < 0.1% |
| 136 | 1 | < 0.1% |
| 135 | 2 | 0.1% |
| 133 | 2 | 0.1% |
| 132 | 1 | < 0.1% |
| 130 | 5 | |
| 128 | 1 | < 0.1% |
| 127.5 | 1 | < 0.1% |
| 125 | 3 |
BMI
Real number (ℝ)
| Distinct | 1297 |
|---|---|
| Distinct (%) | 35.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.784185 |
| Minimum | 15.54 |
|---|---|
| Maximum | 56.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 15.54 |
|---|---|
| 5-th percentile | 20.0575 |
| Q1 | 23.08 |
| median | 25.38 |
| Q3 | 28.04 |
| 95-th percentile | 32.6925 |
| Maximum | 56.8 |
| Range | 41.26 |
| Interquartile range (IQR) | 4.96 |
Descriptive statistics
| Standard deviation | 4.0659127 |
|---|---|
| Coefficient of variation (CV) | 0.15769018 |
| Kurtosis | 2.8349407 |
| Mean | 25.784185 |
| Median Absolute Deviation (MAD) | 2.47 |
| Skewness | 0.99937349 |
| Sum | 94266.98 |
| Variance | 16.531646 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23.48 | 18 | 0.5% |
| 22.54 | 16 | 0.4% |
| 22.91 | 15 | 0.4% |
| 25.09 | 14 | 0.4% |
| 22.19 | 14 | 0.4% |
| 23.1 | 13 | 0.4% |
| 25.23 | 13 | 0.4% |
| 23.09 | 13 | 0.4% |
| 22.73 | 12 | 0.3% |
| 22.9 | 12 | 0.3% |
| Other values (1287) | 3516 |
| Value | Count | Frequency (%) |
| 15.54 | 1 | |
| 15.96 | 1 | |
| 16.48 | 1 | |
| 16.59 | 2 | |
| 16.69 | 1 | |
| 16.71 | 1 | |
| 16.73 | 1 | |
| 16.75 | 1 | |
| 16.87 | 1 | |
| 16.92 | 1 |
| Value | Count | Frequency (%) |
| 56.8 | 1 | |
| 51.28 | 1 | |
| 45.8 | 1 | |
| 44.71 | 1 | |
| 44.55 | 1 | |
| 44.27 | 1 | |
| 44.09 | 1 | |
| 43.69 | 1 | |
| 43.67 | 1 | |
| 43.48 | 1 |
heartRate
Real number (ℝ)
| Distinct | 72 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 75.73058 |
| Minimum | 44 |
|---|---|
| Maximum | 143 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 44 |
|---|---|
| 5-th percentile | 60 |
| Q1 | 68 |
| median | 75 |
| Q3 | 82 |
| 95-th percentile | 96.25 |
| Maximum | 143 |
| Range | 99 |
| Interquartile range (IQR) | 14 |
Descriptive statistics
| Standard deviation | 11.982952 |
|---|---|
| Coefficient of variation (CV) | 0.15823135 |
| Kurtosis | 1.0625405 |
| Mean | 75.73058 |
| Median Absolute Deviation (MAD) | 7 |
| Skewness | 0.67098224 |
| Sum | 276871 |
| Variance | 143.59114 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75 | 507 | 13.9% |
| 80 | 336 | 9.2% |
| 70 | 269 | 7.4% |
| 60 | 207 | 5.7% |
| 85 | 191 | 5.2% |
| 72 | 184 | 5.0% |
| 65 | 175 | 4.8% |
| 90 | 147 | 4.0% |
| 68 | 121 | 3.3% |
| 67 | 86 | 2.4% |
| Other values (62) | 1433 |
| Value | Count | Frequency (%) |
| 44 | 1 | < 0.1% |
| 45 | 2 | 0.1% |
| 46 | 1 | < 0.1% |
| 47 | 1 | < 0.1% |
| 48 | 3 | 0.1% |
| 50 | 21 | |
| 52 | 16 | |
| 53 | 10 | 0.3% |
| 54 | 11 | 0.3% |
| 55 | 32 |
| Value | Count | Frequency (%) |
| 143 | 1 | < 0.1% |
| 140 | 1 | < 0.1% |
| 130 | 1 | < 0.1% |
| 125 | 3 | 0.1% |
| 122 | 2 | 0.1% |
| 120 | 6 | 0.2% |
| 115 | 5 | 0.1% |
| 112 | 2 | 0.1% |
| 110 | 30 | |
| 108 | 5 | 0.1% |
glucose
Real number (ℝ)
HIGH CORRELATION 
| Distinct | 138 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 81.856127 |
| Minimum | 40 |
|---|---|
| Maximum | 394 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 57.1 KiB |
Quantile statistics
| Minimum | 40 |
|---|---|
| 5-th percentile | 62 |
| Q1 | 71 |
| median | 78 |
| Q3 | 87 |
| 95-th percentile | 108 |
| Maximum | 394 |
| Range | 354 |
| Interquartile range (IQR) | 16 |
Descriptive statistics
| Standard deviation | 23.910128 |
|---|---|
| Coefficient of variation (CV) | 0.29209943 |
| Kurtosis | 60.097287 |
| Mean | 81.856127 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 6.2802651 |
| Sum | 299266 |
| Variance | 571.69421 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 75 | 180 | 4.9% |
| 77 | 166 | 4.5% |
| 70 | 150 | 4.1% |
| 73 | 146 | 4.0% |
| 83 | 145 | 4.0% |
| 78 | 139 | 3.8% |
| 80 | 136 | 3.7% |
| 74 | 136 | 3.7% |
| 76 | 121 | 3.3% |
| 85 | 119 | 3.3% |
| Other values (128) | 2218 |
| Value | Count | Frequency (%) |
| 40 | 2 | 0.1% |
| 43 | 1 | < 0.1% |
| 44 | 2 | 0.1% |
| 45 | 4 | 0.1% |
| 47 | 3 | 0.1% |
| 50 | 3 | 0.1% |
| 52 | 2 | 0.1% |
| 53 | 5 | 0.1% |
| 54 | 5 | 0.1% |
| 55 | 13 |
| Value | Count | Frequency (%) |
| 394 | 2 | |
| 386 | 1 | |
| 370 | 1 | |
| 368 | 1 | |
| 348 | 1 | |
| 332 | 1 | |
| 325 | 1 | |
| 320 | 1 | |
| 294 | 1 | |
| 292 | 1 |
TenYearCHD
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 57.1 KiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 3656 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 3099 | |
| 1 | 557 | 15.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 3099 | |
| 1 | 557 | 15.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3099 | |
| 1 | 557 | 15.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3656 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3099 | |
| 1 | 557 | 15.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3656 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3099 | |
| 1 | 557 | 15.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3656 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3099 | |
| 1 | 557 | 15.2% |
| BMI | BPMeds | TenYearCHD | age | cigsPerDay | currentSmoker | diaBP | diabetes | education | glucose | heartRate | male | prevalentHyp | prevalentStroke | sysBP | totChol | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| BMI | 1.000 | 0.144 | 0.089 | 0.147 | -0.132 | 0.146 | 0.378 | 0.105 | 0.083 | 0.069 | 0.061 | 0.207 | 0.293 | 0.220 | 0.326 | 0.150 |
| BPMeds | 0.144 | 1.000 | 0.085 | 0.144 | 0.023 | 0.048 | 0.224 | 0.041 | 0.000 | 0.088 | 0.077 | 0.048 | 0.261 | 0.101 | 0.306 | 0.090 |
| TenYearCHD | 0.089 | 0.085 | 1.000 | 0.229 | 0.052 | 0.008 | 0.168 | 0.090 | 0.088 | 0.123 | 0.015 | 0.089 | 0.180 | 0.040 | 0.215 | 0.096 |
| age | 0.147 | 0.144 | 0.229 | 1.000 | -0.211 | 0.221 | 0.212 | 0.104 | 0.140 | 0.114 | -0.005 | 0.019 | 0.300 | 0.062 | 0.386 | 0.291 |
| cigsPerDay | -0.132 | 0.023 | 0.052 | -0.211 | 1.000 | 0.848 | -0.093 | 0.000 | 0.043 | -0.086 | 0.066 | 0.338 | 0.108 | 0.000 | -0.116 | -0.042 |
| currentSmoker | 0.146 | 0.048 | 0.008 | 0.221 | 0.848 | 1.000 | 0.116 | 0.037 | 0.059 | 0.078 | 0.070 | 0.206 | 0.106 | 0.030 | 0.126 | 0.041 |
| diaBP | 0.378 | 0.224 | 0.168 | 0.212 | -0.093 | 0.116 | 1.000 | 0.052 | 0.044 | 0.052 | 0.179 | 0.066 | 0.642 | 0.048 | 0.780 | 0.195 |
| diabetes | 0.105 | 0.041 | 0.090 | 0.104 | 0.000 | 0.037 | 0.052 | 1.000 | 0.041 | 0.717 | 0.060 | 0.000 | 0.077 | 0.000 | 0.116 | 0.100 |
| education | 0.083 | 0.000 | 0.088 | 0.140 | 0.043 | 0.059 | 0.044 | 0.041 | 1.000 | 0.028 | 0.046 | 0.136 | 0.085 | 0.014 | 0.070 | 0.020 |
| glucose | 0.069 | 0.088 | 0.123 | 0.114 | -0.086 | 0.078 | 0.052 | 0.717 | 0.028 | 1.000 | 0.099 | 0.000 | 0.087 | 0.030 | 0.120 | 0.033 |
| heartRate | 0.061 | 0.077 | 0.015 | -0.005 | 0.066 | 0.070 | 0.179 | 0.060 | 0.046 | 0.099 | 1.000 | 0.110 | 0.142 | 0.000 | 0.175 | 0.094 |
| male | 0.207 | 0.048 | 0.089 | 0.019 | 0.338 | 0.206 | 0.066 | 0.000 | 0.136 | 0.000 | 0.110 | 1.000 | 0.000 | 0.000 | 0.103 | 0.081 |
| prevalentHyp | 0.293 | 0.261 | 0.180 | 0.300 | 0.108 | 0.106 | 0.642 | 0.077 | 0.085 | 0.087 | 0.142 | 0.000 | 1.000 | 0.060 | 0.716 | 0.160 |
| prevalentStroke | 0.220 | 0.101 | 0.040 | 0.062 | 0.000 | 0.030 | 0.048 | 0.000 | 0.014 | 0.030 | 0.000 | 0.000 | 0.060 | 1.000 | 0.066 | 0.000 |
| sysBP | 0.326 | 0.306 | 0.215 | 0.386 | -0.116 | 0.126 | 0.780 | 0.116 | 0.070 | 0.120 | 0.175 | 0.103 | 0.716 | 0.066 | 1.000 | 0.234 |
| totChol | 0.150 | 0.090 | 0.096 | 0.291 | -0.042 | 0.041 | 0.195 | 0.100 | 0.020 | 0.033 | 0.094 | 0.081 | 0.160 | 0.000 | 0.234 | 1.000 |
| male | age | education | currentSmoker | cigsPerDay | BPMeds | prevalentStroke | prevalentHyp | diabetes | totChol | sysBP | diaBP | BMI | heartRate | glucose | TenYearCHD | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 39 | 4.0 | 0 | 0.0 | 0.0 | 0 | 0 | 0 | 195.0 | 106.0 | 70.0 | 26.97 | 80.0 | 77.0 | 0 |
| 1 | 0 | 46 | 2.0 | 0 | 0.0 | 0.0 | 0 | 0 | 0 | 250.0 | 121.0 | 81.0 | 28.73 | 95.0 | 76.0 | 0 |
| 2 | 1 | 48 | 1.0 | 1 | 20.0 | 0.0 | 0 | 0 | 0 | 245.0 | 127.5 | 80.0 | 25.34 | 75.0 | 70.0 | 0 |
| 3 | 0 | 61 | 3.0 | 1 | 30.0 | 0.0 | 0 | 1 | 0 | 225.0 | 150.0 | 95.0 | 28.58 | 65.0 | 103.0 | 1 |
| 4 | 0 | 46 | 3.0 | 1 | 23.0 | 0.0 | 0 | 0 | 0 | 285.0 | 130.0 | 84.0 | 23.10 | 85.0 | 85.0 | 0 |
| 5 | 0 | 43 | 2.0 | 0 | 0.0 | 0.0 | 0 | 1 | 0 | 228.0 | 180.0 | 110.0 | 30.30 | 77.0 | 99.0 | 0 |
| 6 | 0 | 63 | 1.0 | 0 | 0.0 | 0.0 | 0 | 0 | 0 | 205.0 | 138.0 | 71.0 | 33.11 | 60.0 | 85.0 | 1 |
| 7 | 0 | 45 | 2.0 | 1 | 20.0 | 0.0 | 0 | 0 | 0 | 313.0 | 100.0 | 71.0 | 21.68 | 79.0 | 78.0 | 0 |
| 8 | 1 | 52 | 1.0 | 0 | 0.0 | 0.0 | 0 | 1 | 0 | 260.0 | 141.5 | 89.0 | 26.36 | 76.0 | 79.0 | 0 |
| 9 | 1 | 43 | 1.0 | 1 | 30.0 | 0.0 | 0 | 1 | 0 | 225.0 | 162.0 | 107.0 | 23.61 | 93.0 | 88.0 | 0 |
| male | age | education | currentSmoker | cigsPerDay | BPMeds | prevalentStroke | prevalentHyp | diabetes | totChol | sysBP | diaBP | BMI | heartRate | glucose | TenYearCHD | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 4224 | 1 | 47 | 2.0 | 1 | 3.0 | 0.0 | 0 | 0 | 0 | 198.0 | 120.0 | 80.0 | 25.23 | 75.0 | 76.0 | 0 |
| 4225 | 1 | 45 | 4.0 | 1 | 43.0 | 0.0 | 0 | 0 | 0 | 216.0 | 137.5 | 85.0 | 24.24 | 83.0 | 105.0 | 0 |
| 4226 | 1 | 58 | 1.0 | 0 | 0.0 | 0.0 | 0 | 0 | 0 | 233.0 | 125.5 | 84.0 | 26.05 | 67.0 | 76.0 | 1 |
| 4227 | 1 | 43 | 4.0 | 1 | 20.0 | 0.0 | 0 | 0 | 0 | 187.0 | 129.5 | 88.0 | 25.62 | 80.0 | 75.0 | 0 |
| 4228 | 0 | 50 | 1.0 | 0 | 0.0 | 0.0 | 0 | 1 | 1 | 260.0 | 190.0 | 130.0 | 43.67 | 85.0 | 260.0 | 0 |
| 4231 | 1 | 58 | 3.0 | 0 | 0.0 | 0.0 | 0 | 1 | 0 | 187.0 | 141.0 | 81.0 | 24.96 | 80.0 | 81.0 | 0 |
| 4232 | 1 | 68 | 1.0 | 0 | 0.0 | 0.0 | 0 | 1 | 0 | 176.0 | 168.0 | 97.0 | 23.14 | 60.0 | 79.0 | 1 |
| 4233 | 1 | 50 | 1.0 | 1 | 1.0 | 0.0 | 0 | 1 | 0 | 313.0 | 179.0 | 92.0 | 25.97 | 66.0 | 86.0 | 1 |
| 4234 | 1 | 51 | 3.0 | 1 | 43.0 | 0.0 | 0 | 0 | 0 | 207.0 | 126.5 | 80.0 | 19.71 | 65.0 | 68.0 | 0 |
| 4237 | 0 | 52 | 2.0 | 0 | 0.0 | 0.0 | 0 | 0 | 0 | 269.0 | 133.5 | 83.0 | 21.47 | 80.0 | 107.0 | 0 |